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Sums of products of polynomials in few variables : lower bounds 

and polynomial identity testing 


Mrinal Kumar* Shubhangi Sara^ 


Abstract 

We study the complexity of representing polynomials as a sum of products of polynomials 
in few variables. More precisely, we study representations of the form 

T d 

i=l3=1 

such that each Qu arbitrary polynomial that depends on at most s variables. 

We prove the following results. 

• Over fields of characteristic zero, for every constant ^ such that 0 < /r < 1, we give an 

explicit family of polynomials {CW}, where Pn is of degree n in iV = variables, such 

that any representation of the above type for P/v with s = requires Td> This 

strengthens a recent result of Kayal and Saha [KS14a] which showed similar lower bounds 
for the model of sums of products of linear forms in few variables. It is known that any 
asymptotic improvement in the exponent of the lower bounds (even for s = ^/n) would 
separate VP and VNP [KS14a) . 

• We obtain a deterministic subexponential time blackbox polynomial identity testing (PIT) 
algorithm for circuits computed by the above model when T and the individual degree of 
each variable in P are at most log*^*-^^ N and s < for any constant p. < 1/2. We 
get quasipolynomial running time when s < log*^*-^^ N. The PIT algorithm is obtained by 
combining our lower bounds with the hardness-randomness tradeoffs developed in [DSY091 
IKI04] . To the best of our knowledge, this is the first nontrivial PIT algorithm for this 
model (even for the case s = 2), and the first nontrivial PIT algorithm obtained from 
lower bounds for small depth circuits. 0 
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in part by NSF grant CCF-1253886. 

^Department of Computer Science and Department of Mathematics, Rutgers University. Email: 
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^In a recent independent work, Forbes |For| does blackbox identity testing for another subclass of depth four 
circuits using shifted partial derivative based methods. To the best of our understanding, the results in these two 
papers are incomparable even though both rely on similar techniques. 
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1 Introduction 


Arithmetic circuits are the most natural model of computation for a wide variety of algebraic 
problems such as matrix multiplication, computing fast fourier transforms etc. The problem 
of proving lower bounds for arithmetic circuits is one of the most fundamental and interesting 
problems in complexity theory. Proving superpolynomial lower bounds for general arithmetic 
circuits would resolve the VP versus VNP conjecture |Val79j . the algebraic analog of the P vs NP 
conjecture. This is one of the holy grails of complexity theory and has received a lot of attention, 
since it is a more structured and potentially easier question to understand and analyse than the 
P vs NP problem . 

The intimately related problem of polynomial identity testing (PIT) is the problem of testing 
if a polynomial, given as an arithmetic circuit is identically zero. In the setting where the 
algorithm cannot look inside the circuit, but only has access to evaluations of the circuit, the 
problem is referred to as blackbox PIT. There is a very simple randomized algorithm for this 
problem - simply evaluate the polynomial at a random point from a large enough domain. With 
very high probability, a nonzero polynomial will have a nonzero evaluation |Sch80[ |Zip79| . It is 
a very important and fundamental question to derandomize the above algorithm. In a seminal 
work, Kabanets and Impagliazzo [KI04] showed that the problem of proving lower bounds for 
arithmetic circuits and the problem of derandomizing identity testing are essentially equivalent! 

These two problems have occupied a central position in complexity theory and despite much 
attention, our understanding of general arithmetic circuits is still very limited. Thus there has 
been a great deal of effort in understanding the complexity of restricted classes of arithmetic 
circuits in an attempt to obtain a better understanding of the general problem. Low depth 
arithmetic circuits in particular are one such well studied class. 

Lower bounds for homogeneous low depth arithmetic circuits. The last few 
years have seen a tremendous amount of exciting progress on the problems of “depth reduction” 
of general arithmetic circuits to low depth arithmetic circuits, and of proving lower bounds for 
low depth arithmetic circuits. Using depth reduction techniques [VSBR,83l IAV081 lKoil21 ITavl.l) 
it was shown that lower bounds (for polynomials in N variables and of degree n) for 

just homogeneous depth 4 arithmetic circuits of bottom fan-in ^/n would suffice to separate VP 
from VNP and imply superpolynomial lower bounds for general arithmetic circuits. At the same 
time there was a very exciting line of works proving lower bounds for the same model of 

arithmetic circuits (and in fact for even the more general class of homogeneous depth 4 arithmetic 
circuits with no restriction on bottom fan-in) [GKKSal IFLMSl IKSSl IKSl IKLSSl IKSI4b) . 

Lower bounds for non-homogeneous low depth arithmetic circuits. Despite 
all this remarkable progress, and some very strong lower bounds for homogeneous low depth 
arithmetic circuits, in the nonhomogenous world much less is understood. Only mild lower 
bounds are known when we drop the condition of homogeneity, even for very simple classes of 
low depth arithmetic circuits. For depth 3 circuits over fields of characteristic 0, only quadratic 
lower bounds known [SWOll |Shp01| , and there has been no progress on this question in more 
than a decade now. 

In a beautiful depth reduction result over fields of characteristic 0, Gupta et al [GKKSbj 
showed that lower bounds (for polynomials in N variables and of degree n) for the class 

of non-homogeneous depth 3 circuits would already separate VP from VNP. It was recently 
observed by Kayal and Saha |KSI4a] d that in fact it suffices to prove such lower bounds for 
depth 3 circuits with bottom fan-in y/n. 

^They attribute the observation to Ramprasad Saptharishi. 
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Till recently (in particular till the work of |KS14a) l. the best known lower bounds for depth 
3 circuits even with bottom fan-in 2 were still just quadratic. In a very nice recent result, 
Kayal and Saha |KS14a) showed an exponential lower bound for depth 3 circuits over helds of 
characteristic 0, whose bottom fan-in is at most N^, where N is the number of variables and 
0 < /r < 1 is an arbitrary constant. More precisely, they prove the following. 

Theorem 1.1 (Kayal-Saha |KS14a) l. Let F be afield of charaeteristic zero. Then, for every 
constant 0 < /i < 1 there is a family {LW} of degree n polynomials in N = variables over 

F in VNP such that any depth three circuit of bottom fan-in at most iV^ computing Pn has top 
fan-in at least . 

Our Model: In this work, we consider the model of sums of products of polynomials in few 
variables. More formally, we consider representations of polynomials P (degree n in TV = 
variables) in the form 

T d 

^=E n Qb (1) 

i=i i=i 

where each Qij is an arbitrary polynomial (of arbitrarily high degree) in at most s variables. 
We call this the model of EH (EII)^*^ circuits. 

Observe that the model is more general than that considered in [KS14a) . The model in 
|KSI4a) corresponds to sums of products of linear forms in few variables. In our case, the Qij 
no longer have to be linear forms, but can be general polynomials of arbitrarily high degree. 
Prior to this work, even for the case when s = 2, there were no nontrivial lower bounds known 
for this model. 

En(En)^*^ circuits for s > 2 can also be seen as a generalization of the model of sums 
of products of univariate polynomials (which corresponds to En(En)^^^ circuits with s = 1), 
which has been very well studied in the arithmetic circuit complexity literature. Lower bounds 
for Eli (Eli) circuits follow from works of Nisan [NisQIj and Saxena |Sax07| . Over the last 
few years, there have been some very nice results giving quasipolynomial time blackbox identity 
testers for En(En)^^^ circuits |FS13al lFS13b[ IXSSI3) . En(En)^'*^ circuits can also be seen as 
a generalization of the widely studied model of diagonal circuits, since polynomials computable 
by diagonal circuits can be represented as a En(En)^^^ circuit without much blow up in the 
size of the representation [SaxO?) . 

Although En(En)^^^ circuits seem fairly well understood from the point of view of lower 
bounds and derandomization of polynomial identity testing, if one considers the model of sums 
of products of bivariate polynomials (EH (EH) circuits), then our understanding changes 
completely. Although only seemingly a mild generalization of En(En)^^^ circuits, the known 
proof techniques for lower bounds for En(En)^^^ circuits (which were proved using evaluation 
dimension techniques of |Nis911 [Raz06) l seem to completely break down in this setting. Thus, 
studying this model seems like an interesting next step towards understanding non-homogeneous 
small depth algebraic computation. As far as we are aware there are also (not surprisingly) no 
nontrivial PIT results for the model. We are now ready to state our results. 

1.1 Our results 

Lower bounds : We show an exponential lower bound for the model of EH (EII)^®^, when 
s is at most for any constant 0 < /r < I {N is the number of variables). More precisely, we 
show the following. 

Theorem 1.2. Let F 6e a field of characteristic zero and p, be any constant such that 0 < yi < 1. 
There exists a family {Pat} of polynomials over F in VNP, where Pn is of degree n in N = 
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variables, such that for any representation of Pjv of the form 

T d 

Pn = Qij 

i=l i=l 

where each Qij is polynomial in at most s = variables, it must be true that 

T-d> 

Given the depth reduction results of |GKKSb] and the observation mentioned earlier from [KS14a] , 
it is known that any asymptotic improvement in the exponent of the lower bound (even for 
s = 0(i/n)) would imply VNP is different from VP. 

As discussed in the introduction, even though this model seems a natural generalization of 
the model of sums of products of univariate polynomials, our lower bound technique is very 
different from those used in proving lower bounds for sums of products of univariates. Our 
lower bound proof is based on ideas developed in the course of investigating homogeneous depth 
four arithmetic circuits |KLSS1 IKS 14b) . 

Blackbox PIT : We also consider the problem of PIT for the model of EH circuits. 

For general sums of products of even bivariate polynomials, this question seems quite difficult, 
and as of now we are not even able to obtain subexponential time PIT. However, as a consequence 
of our lower bounds and by suitably adapting hardness randomness tradeoffs for arithmetic 
circuits developed in [KI04] and [DSY09] , we are able to obtain PIT results in the setting where 
the top fan-in of the circuit is bounded, and when we have the promise that the circuit computes 
a polynomial of low individual degree. 

Our understanding of blackbox PIT for depth four circuits is very limited, and the results 
known are in very restricted settings. Saraf and Volkovich [SVllj gave blackbox PIT algorithms 
for multilinear depth 4 circuits with bounded top fan-in. To the best of our knowledge, the 
idea in |SVII) does not extend to the case of non-multilinear depth 4 circuits, even when the 
individual degree of each of the variables is at most 2. Recently, Oliveira et al |dOSV 14] gave a 
subexponential time blackbox PIT for all depth four multilinear circuitt[^. In the non-multilinear 
setting, Agrawal et al. |ASSSI2] gave PIT algorithms for constant depth formulas in which the 
number of occurences of each variable is bounded. Without going into the technical details, we 
remark that the notion of bounded occur is a generalization of the well studied notion of bounded 
reads. The most closely related results to those in this paper that we are aware of are the recent 
papers of Gupta |GupI4| and Mukhopadhyay [Mukj , which give blackbox PIT results for sums 
of products of low degree polynomials, where the top sum fan-in is bounded and the circuits 
satisfy certain algebraic geometric restrictions. 

So, the question of getting PIT results for general depth four circuits (even with bounded 
top and bottom fan-in) remains wide open. For instance we still do not know any nontrivial PIT 
results for a sum of constant many products of degree 2 polynomials. Though we still don’t know 
how to deal with this question, when we replace the polynomials of low degree with polynomials 
of few variables (but of arbitrarily large degree), then we are able to obtain quasipolynomial 
PIT results. There is one added caveat however, that the final polynomial computed needs 
to be of low individual degree (as seems necessary for PIT results obtained from the known 
hardness-randomness tradeoffs for bounded depth circuits jDSY09]). We now formally state the 
theorem. 

®The running time increases with the size of the circuit, and in particular, it is subexponential time for polynomial 
sized depth four multilinear circuits. 
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Theorem 1.3. Let c and /i he arbitrary constants such that c > 0 and 0 < fj- < and let F 
be a field of characteristic zero. Let C be the set of polynomials P in N variables and individual 
degree at most k over F, with the property that P can be expressed as 

T d 

^=X/ n 

i=i i=i 

such that 

1 . r<log^iV 

2. k < log° N 

3. d<N’^ 

4- each Qij depends on at most variables 

Then, there exists a constant e < 1 dependent only on c and fi, such that there is a hitting set 
of size exp(A^'^) for C which can be constructed in time exp(A^'^). 

Moreover, from our proof, it also follows that if each of polynomial Qij depends only on 
log^^^^ N variables, then both the size of the hitting set and the time to construct it, are upper 
bounded by a quasipolynomial function in N. 

Organisation of the paper: We provide an overview of the proofs in Section [2j We 
describe some definitions and preliminaries in Section |3l We present the proof of the lower 
bound in Section S) We describe the application to blackbox PIT in Section [S] and conclude 
with some open problems in Section |6l 

2 Proof overview 

In this section, we provide an overview of the main ideas in proofs of Theorem 11.21 and Theo¬ 
rem [T|3l 

2.1 Overview of proof of Theorem 11.21 

We restate Theorem 11.21 for the sake of clarity. 

Theorem 11.21 Let F 6e a field of characteristic zero and p, he any constant such that 0 < p < 1. 
There exists a family {Pat} of polynomials over F in VNP, where Pjq is of degree n in N = 
variables, such that for any representation of Pjq of the form 

T d 

Pn = Qij 

i=i i=i 

where each Qij is polynomial in only variables, it must be true that 

T-d> 

The key difference between proving the above lower bound and the lower bounds for ho¬ 
mogeneous depth four circuits is that the formal degree of the circuit in the above case could 
be much larger than the degree of the polynomial, which is n. In fact, even the fan-in of the 
product gates at level 2, that is d could be much larger than n. Therefore, a straightforward 
application of homogeneous depth four circuit lower bounds does not seem to work. Our proof 
is in two steps and at a high level follows the strategy of the lower bound for non-homogeneous 
depth three circuits with bounded bottom fan-in by Kayal and Saha [KS14a] with some key 
differences. 
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• In the first step, we obtain another representation of Pat, as 

pn= Y. 

i=i j=i 

where every monomial in each of the has support at most s, although each 
could now depend on all the variables. The key property that we have gained from this 
transformation is that the fan-in of the product gates at level two is bounded by n now, 
which is the degree of Pn- However, we have no bound on the degree of the Moreover, 
we have blown up the top fan-in a bit, but we will be able to tolerate this loss if s is small. 

• In the second step, the strategy can be seen in two stages. If p. was very small, say 0.001, 
then we could have taken advantage of the fact that in the representation obtained in the 
first step above, the product fan-in is at most n and the support of every monomial in each 
of the Q'j is small, to prove an upper bound on the dimension of the space of projected 
shifted partial derivatives of the above representation. Comparing this dimension with that 
of our hard polynomial gives us our lower bound. For larger values of p, we use random 
restrictions to ensure that all the monomials of large support in are set to zero. At 
the end of such a procedure, we are back to the low support case. This step of the proof 
is closely along the lines of the proof of homogeneous depth four arithmetic circuit lower 
bounds in |KLSS1 IKS 14b) although in the present case, formal degree of the circuit could 
be as large as n^, which is much larger than the degree of the polynomial Pn. For such 
large formal degrees, in general we do not even know lower bounds for non-homogeneous 
depth three circuits. 

We would like to point out that the first step of the proof above is similar to the homogenization 
step in the proof of lower bounds for general depth three circuits with bounded bottom fan-in 
by Kayal and Saha [KS14a) . The key difference is that while the circuit they obtain at the 
end of this step is a strictly homogeneous circuit of formal degree n, we are unable to get a 
similar structure. The complication stems from the fact that when are not affine forms, 
they could contain monomials of varying degrees. In this case, it seems difficult to obtain a 
strict homogenization with a small blow up in size. We get around this deficiency by a more 
subtle analysis in the second step, where we show a lower bound for a circuit which has a formal 
degree much larger than the degree of the polynomial being computed, but has some added 
structure. This step critically uses that the fact that the product fan-in at level two of these 
circuits is at most n, and the support of every monomial in each of the is small. 

2.2 Overview of proof of Theorem 11.31 

We first restate Theorem 11.31 

Theorem 11.31 Let c and p be arbitrary constants such that c > 0 and 0 < p < 1/2, and let F 
be a field of characteristic zero. LetC be the set of polynomials P in N variables and individual 
degree at most k over F, with the property that P can be expressed as 

T d 

H ^ Qij 
i=i i=i 


such that 

1 . r<log°iV 

2. k < log'^ N 

monomial is said to have support support s if it depends on at most s distinct variables. 
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3. d<N^ 

4- each Qij depends on at most variables 

Then, there exists a constant e < 1 dependent only on c and fi, such that there is a hitting set 
of size exp(A^'^) for C which can be constructed in time exp(A^'^). 

The construction of the hitting set is based on the well known idea of using hard functions 
for derandomization. Our goal is to reduce the number of variables from N to at most 
for some constant <5 < 1, while maintaining the zeroness/nonzeroness of the polynomial be¬ 
ing tested [KI041IDSY09] . Once we have done this, we take a brute force hitting set of size 
(Degree -|- as given by Lemma [5.51 To reduce the number of variables, we 

use the framework introduced by Kabanets and Impagliazzo |KI04] . 

The key technical step of the proof is to show that for a non-zero polynomial P as defined 
above, if there exists a polynomial / G F[Xi, Y 2 ,..., Xi-i, Xi+i, Xi+ 2 , • ■ ■, Xn] such that Xi — f 
divides P, then / can also be expressed as a sum of products of polynomials in few variables 
of reasonably small size. This step crucially uses a statement about complexity of roots of 
polynomials computed by low depth circuits from [DSY09) . Therefore, if / is a polynomial 
which does not have a small representation as a sum of products of polynomials in few variables, 
then Xi — f does not divide P. This observation guarantees that the construction of hitting 
sets from hard polynomials given by [KI04) works for this class of circuits. 

3 Notation and Preliminaries 

We now introduce some notation and preliminary notions that we use in the rest of the paper. 

Computational model : In this work, we consider the model of sums of products of 
polynomials in few variables. More formally, we consider representations of polynomials P 
(degree n in iV = variables) in the form 

T d 

^ = (2) 
i=l i=l 

where each Qij is an arbitrary polynomial (of arbitrarily high degree) in at most s variables and 
each ai is a field constant. We call this the model of Eli (EII)^®^ circuits. We use the quantity 
Td as a measure of the size of a Eli (EII)^®^ circuit. Without loss of generality, we can assume 
that the degree zero term in each of the Qij is either zero or one. If it is a non-zero constant 
other than I, we can extract it out and absorb it in ai. For each of the product gates, the fan-in 
could be different, but we can assume without loss of generality that all the product fan-ins 
are equal to d. Observe that the d could be much larger than the degree of the polynomial P. 
Throughout this paper, we will be working over a field of characteristic zero. 

Some basic notations : 

1. For an integer i, we denote the set {1,2,..., i} by [*]. 

2. By X, we mean the set of variables {Xi, X 2 ,..., Yw}. 

3. For a polynomial P and a positive integer i, we represent by Hom*[P], the homogeneous 
component of P of degree equal to i. By Hom-*[P] and Hom-*[P], we represent the 
component of P of degree at most i and at least i respectively. 

4. The support of a monomial a is the set of variables which appear with a non-zero exponent 
in a. We denote the size of the support of a by Supp(a). 
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5. Throughout the paper, we say that a function f{N) is subexponential in N if there exists 
a positive real number e, such that e < 1 and for all N sufficiently large, f{N) < exp{N’^). 

6 . We say that a function f{N) is quasipolynomial in N if there exists a positive absolute 
constant c, such that for all N sufficiently large, f{N) < exp(log'^ A^). 

7. In this paper, we only consider layered arithmetic circuits and we will be counting levels 
from top to bottom, starting with the output gates being at level one. 

8 . By a EIISA circuit, we refer to a depth four circuit with all the product gates at the lowest 
level being replaced by powering (A) gates. Similarly, by a EIIE A EH circuit, we mean a 
depth six circuit all of whose product gates at level four from the top are powering gates. 

Hitting set : Let C be a set of polynomials in N variables over a field F. Then, a set 

ncF^ is said to be a hitting set for the class C, if for every polynomial P G C such that P is 
not the identically zero polynomial, there exists a. p €% such that P[p) ^ 0. 

Elementary symmetric polynomials : For variables X = {Xi,X 2 , ■ ■ ■, X^} and any 
integer 0 < I < N, the elementary symmetric polynomial of degree I on variables X is defined 
as 

ESYMi(X)= Y. n^^- 

sc[N],\s\=ijes 

Projected shifted partial derivatives : A key idea behind the recent progress on lower 
bounds is the notion of shifted partial derivatives introduced in |Kayl2| . In this paper, we 
use a variant of the measure, called projected shifted partial derivatives introduced in [KLSS] 
and subsequently used in [KS14b) . Although we never explicitly do any calculations with the 
measure in this paper, we provide a brief introduction to it below since the bounds are based 
on it. 

For a polynomial P and a monomial 7 , dj{P) is the partial derivative of P with respect to 
7 . For every polynomial P and a set of monomials Ai, d>t(P) is the set of partial derivatives of 
P with respect to monomials in Ai. The space of (AI,m)-projected shifted partial derivatives 
of a polynomial P is defined below. 

Definition 3.1 ((AI,m)-projected shifted partial derivatives). For an N variate polynomial 
P G F[Ai, A 2 ,... iXjsi], set of monomials Ai and a positive integer m > 0, the space of{A4,m)- 
projected shifted partial derivatives of P is defined as 

{dM{P))m '"= ¥-span{a{\{x, ■ 9) ■ 9 ^ dM{P),SC [A], |^| = m} (3) 

ieS 

Here, cr(P) of a polynomial P is the projection of P on the multilinear monomials in its 
support. The measure of complexity of a polynomial that we use in this paper, is the dimension 
of projected shifted partial derivative space of P with respect to some set of monomials AI and 
a parameter m. Formally, 

^M.miP) = Dim((a7Vt(T’))m) 

From the definitions, it is straight forward to see that the measure is subadditive. 

Lemma 3.2 (Sub-additivity). Let P and Q be any two multivariate polynomials in F[Ai, X 2 , ■ ■ ■, A^v]. 
Let Ai be any set of monomials and m be any positive integer. Then, for all scalars a and /3 


■ P + p ■ Q) < ^M,m.{P) + ^M,m.{Q) 








Approximations : We will refer to the following lemma to approximate expressions during 
our calculations. 


Lemma 3.3 l [GKKSa] l. Let a{n), f(n), g{n) : Z>o —>■ Z>o be integer valued functions such 
that if + g) = o(a). Then, 


log 


(« + /)! 
ia-g)'. 


if + g) \oga ± O ^ 


In the proofs in this paper, we use Lemma 13.31 only in situations where (/ + g)'^ will be 0(a). 
In this case, the error term will be bounded by an absolute constant. So, up to constant factors. 
We use the symbol ~ to indicate equality up to constant factors. 


Complexity of coefficients and homogeneous components : We now summarise 
two simple lemmas which are useful for our proof. The first lemma summarises that given a 
circuit C for a polynomial P €'¥[X1,X2,... of degree at most d, for every 0 < i < d, 

the coefficient of F® in P (when viewing P as a polynomial in F[Ai, A 2 ,... ,X 7 v][F]) can also 
be computed by a circuit of size not much larger than the size of C. 

Lemma 3.4. Let P G F[Ai, X 2 ,..., Xn, Y] be a polynomial of degree at most d in Y over a 
field F of characteristic zero, such that P is computable by an arithmetic circuit C of size \C\. 
Let 

d 

P = ^Q,(Ai,A2 ,...,Xiv)-F® 
i=0 

for polynomials QiiXi, X 2 ,..., Xj^) G F[Ai, A 2 ,..., A^y]. Then, for every i such that 0 < i < d, 
the polynomial Qi can be computed by an arithmetic circuit C' of size at most \C\ ■ (d + I). 
Moreover, if the output gate of C is a gate, then the depth of C is equal to the depth of C. 
Else, the depth of C' is at most I more than the depth of C. 

Proof. We can view P as a, univariate polynomial of degree at most d in F with the coefficients 
coming from F(A). From the classical Lagrange interpolation, we know that the coefficient of 
F® in P can be written as an F(X) linear combination of the evaluations of P at d + I distinct 
values of F taken from F(A). In fact, more strongly, we can evaluate P at d + I values of F 
all chosen from F itself, in which case the constants in the linear combination are also from F. 
So, Qi can be computed by a circuit obtained from taking d + 1 circuits each obtained from P 
by substituting F by a scalar in F, and taking their linear combination. Let this circuit be C. 
Clearly the size of C is at most (d + 1) times the size of C. If the output gate of C was an 
addition gate, then the outer addition for the linear combination can be absorbed into it, and 
the depth remains the same. Else, the depth increases by one. □ 

The second lemma stated below essentially says that the circuit complexity of homogeneous 
components of a polynomial is not much larger than the circuit complexity of the polynomial 
itself. 

Lemma 3.5. Let P be a polynomial of degree at most d in N variables over a field F of 
characteristic zero, such that P is computable by an arithmetic circuit C of size \C\. Then, for 
every i such that 0 < i < d, the homogeneous component of degree i of P can be computed by 
an arithmetic circuit C of .size at most \C\ ■ (d + 1). Moreover, if the output gate of C is a + 
gate, then the depth of C is equal to the depth of C. Else, the depth of C is at most I more 
than the depth of C. 
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Proof. Let P'(t) be the polynomial obtained from P by replacing every variable X in P by 
X ■ t for a new variable t. We can view P' to be a univariate polynomial of degree at most d 
in t with the coefficients coming from F(X). Observe that for every i such that 0 < * < d, the 
homogeneous component of P of degree equal to i is equal to the coefficient of P in P'. The 
proof now follows from Lemma 13.41 □ 


4 Proof of the lower bound 

In this section, we give the proof of Theorem 11.21 We prove the lower bound for a variant of 
the well known family of Nisan-Wigderson polynomials defined by Kayal and Saha [KS14a) . 

4.1 Target polynomials for the lower bound 

We now define the family of polynomials of degree n in N variables for which we prove the lower 
bounds. The family is a variant of the Nisan-Wigderson polynomials which were introduced by 
Kayal et al in |KSS] in the context of lower bounds for homogeneous depth four circuits. The 
particular variant we use in the paper is due to Kayal and Saha |KS14aj . 

The tradeoff between the number of variables N and the degree n will be parameterized by 
the parameter /i where 0 < fj. < 1. First we need some parameters, which we define below. 

1 . d=(l — /r)/2isa positive real number such that + 6 < 1. 

2 . 

' l—fl — d 

3. N is chosen such that N/n is a prime number between and 2n^“'"'’'. Such a prime 

number always exists from the Bertrand-Chebychev theorem. Without loss of generality, 
we pick the smallest one. 

5- D = 2 {i!’'y) ■ ^ ’ where D — 1 is the degree of the underlying univariate polynomials in the 
definition of NWn,fi. 

Let ip be the prime number equalling N/n. We are now ready to restate the definition of NWn.^i 
from [KS14a| . 

Definition 4.1 (Nisan-Wigderson Polynomials |KS14a) L Let ^ be a real number such that 
0 < fi < 1. For a given and n, let N, D, ip be as defined above. For the set of N variables 
{Xij : i € [n], j € [ip]}, we define the degree n homogeneous polynomial NWn,^ as 

NWn., = E n 

/(z)GF,/,[z] iGH 
deg(f)<D-l 

From the definition, we can observe the following properties of NWn.g,- 

1. The number of monomials in NWn.g is exactly ip^ = 

2. Each of the monomials in NWn,fj, is multilinear. 

3. Each monomial corresponds to evaluations of a univariate polynomial of degree at most 
D — 1 at all points of Fy,. Thus, any two distinct monomials agree in at most D — 1 
variables in their support. 

We will also need the following lemma in our proof. 

Lemma 4.2. Let g. be a non-negative real number less than 1. Given q £ F'^, p., n, we ean 
evaluate the polynomial NWn.g. at q in time . 
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Proof. Given n and we first find if as given by the choice of parameters. Once we have D, 
we iterate through every monomial a of degree n in the X variables which is supported on all 
the rows of the variable matrix and check if it is in the polynomial NWn,ii by trying to find a 
univariate polynomial f{z) € F^[z] such that degree of / is at most D — 1 and Oiefn] 

The interpolation takes only Poly(n) time, and the total number of monomials to try is at most 
TV". So, we get the lemma. □ 

We now proceed with the proof as outlined in Section 12.11 


4.2 Reducing the product fan-in at level two 

Let P be a homogeneous polynomial in N variables of degree n which has a Eli (EII)^®^ circuit 
of top fan-in T and product fan-in d at the second level. In other words, there exist polynomials 
{Qij : i £ [T],j £ [d]} in at most s variables each, such that 

T d 

^ = (4) 

i=l j=l 

Recall that without loss of generality, we can assume that the constant term in each of the Qij 
is either 0 or 1. We have the following lemma. 

Lemma 4.3. Let ¥ be a field of characteristic zero. Let P be a homogeneous polynomial of 
degree n in N variables over F as defined above. For each i, 1 < i < T define the set 

Si = {j : 1 < j < d and Hom°[(5y] = 1} 


Then, 


T 

P — ai ■ Horn"' 

i=l 


n Q^J X ^ ESYMi({Hom^i[Qy] : j £ S,}) 

j^Si 1=0 


(5) 


Proof. To prove the lemma, we will try to extract out the homogeneous part of degree n of 
each product gate Yl‘j=iQij- Together with the fact that the polynomial P is homogeneous 
of degree n, we get the lemma. Every Qij with a non-zero constant term can be written as 
Hom-^[(5y] + 1, since the constant term in each Qij is either 0 or 1. Now, 


d 

n Qij = II ‘5b X H (Hom-^[( 3 y] -b 1 ) ( 6 ) 

1=1 j^Si j&Si 

Decomposing the product OjeSi + 1) further, we have 

is.l 

H (Hom-^[Qy]-b 1] = ^ E n(7) 

j&Si 1=0 UCSi-.\U\=l j&U 

Now, observe that the degree of every monomial in Yij^u Houn-^[(5y] is at least as large as the 
size of U. So, for every subset U of size larger than n, Y\j^u Hom-^[(3y] is a polynomial of 
degree strictly larger than n. Also, for any fixed I, the expression '^ucSi-\u\=i Ojec/ 
is precisely the elementary symmetric polynomial of degree I in the set of variables {Hom-^[( 5 ij] : 
j £ Si}. Therefore, 


Horn-" 

H (Hom-^[Qy] -b 1 ) 

= Horn-" 

' n 

^ESYMz({Hom^i[Q„] : j£50) 


_j€Si 


J=0 
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Therefore, 


Horn" 


1 

c5 

1 _ 

= Horn” 

n 

n Q^3 X ^ ESYMi({Hom^i[Qy] : j G 



jiSi 1=0 


Summing up for all i, we get the lemma. 


(9) 

□ 


The lemma above has in some sense helped us locate the monomials of degree n in the 
circuit, which otherwise has a much higher formal degree. We now combine the above lemma 
with the well known fact that elementary symmetric polynomial of degree I in k variables can be 
computed by homogeneous EIISA circuits of size at most to obtain a SnS A Eli circut 

C' such that the fan-in of the product gates at level two is at most n. We use the following 
theorem (Theorem 5.2) by Shpilka and Wigderson [SWOl) . 

Theorem 4.4 (Shpilka-Wigderson |SW01) i. For every set of variables {1^,12, ■ • ■, dm} and a 
positive integer I, ESYMi({li, y 2 , ■ • ■ ,Ym}) can be computed by a homogeneous EIIEA cireuit of 
size m20(^). 

We now prove the following lemma. 

Lemma 4.5. Let ¥ be a field of characteristic zero. Let P be a polynomial of degree n in N 
variables over F whieh is computable by an EH (EII)^^^ circuit C of top fan-in T and the degree 
of product gates at level two being d. So, P can be represented as 

T d 

P = cxi ■ Qij 

i=l 0=1 


Then, P can be represented as the homogeneous component of degree n of a polynomial computed 
by a EIIE A EH circuit C" with the following properties : 

1. The inputs to the A gates are the polynomials '■ 1 < i < T,1 < j < d} 

2. The fan-in of the x gates at the second level from the top is at most n 

3. The top fan-in of C” is at most Tdn2^^x^\ 


Proof. From Lemma lT3l we know that for the set Si defined as 


Si = {j : I < j < d and Hom°[Qy] = 1} 
the polynomial P can be written as 

T 


P — ai ■ Horn" 


i=l 

which is the same as 

P = Horn” 


n Q^J X ^ ESYMi({Hom^i[Qy] : j S 

jdSi 1=0 


• n Qb X ^ ESYMi({Hom^i[Qy] : j S 

i=l jdSi 1=0 


Observe that the polynomial nj^ 5 . Qij has degree at least d— 15^1. We remark that if d— |S'i| is 
larger than n, then such product gates do not contribute anything to the degree n component of 
the polynomial and hence can be discarded without loss of generality; hence we assume n — {d — 
|S'i|) > 0. So, we could confine the inner sum from I = 0 to I = n — {d— |5'i|), and still preserve 
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the degree n part of the polynomial, which is what we are interested in. From Theorem 14.41 we 
know that for every 0 < I < n, we can compute the polynomial ESYM;({Hom-^[(5y] : j S Si}) 
by a EIISlA circuit of top fan-in at most d x which takes as input the polynomials 

{Hom-^(Qij) : 1 < j < d}. From the homogeneity of the circuits given by Theorem 14.41 it 
follows that the product gates at level two of these circuits have fan-in at most the degree of 
polynomial they compute, which is at most n — (d — |5'i|). So, it follows that the polynomial 

/ T n-(d-ISil) 

p = • n Qb X E ESYMi({Hom^i[g,,] : j G 5,}) 

\i=l j^Si 1=0 

can be computed by a SnS A Eli circuit, with top fan-in at most Tdn ■ which satisfies 

the conditions in the lemma. □ 

Finally, given the circuit C" constructed above, we can construct a circuit which computes 
the polynomial P as given by Lemma l3.51 For this, observe that the monomials of degree strictly 
larger than n in any of the Qij do not contribute to degree n part of P. So, we can drop them, 
while still preserving the degree n part of P. Therefore, the degree of P can be upper bounded 
by ri^d. We can recover the degree n part of P by interpolation which blows up the top fan-in 
by a factor of at most n^d. 

In this process, the fan-in of the product gates at level two remains unchanged. Strictly 
speaking, inputs to the powering gate A at level four may no longer be the polynomials Hom-^ , 
since in the process of interpolation, we replaced every variable Xi by Xi.t in P and looked at 
the resulting polynomial P' as a univariate polynomial in t over the function field F(X). We 
then evaluated P' at sufficiently many values of t G F and then took their F linear combination. 
So, each of the polynomials Hom-^[Qy] gives rise to many other polynomials, one each for 
different values of t. We will call them the siblings of Hom-^[Qy]. The key observation for our 
proof is that the set of variables in the siblings of Hom-^[Qy] is the same as the set of variables 
in From the lemma and the discussion above, we have the following corollary. 

Corollary 4.6. Let ¥ be a field of characteristic zero. Let P be a polynomial of degree n in N 
variables over F which is computable by an EH (EII)^*^ circuit C of top fan-in T and the degree 
of product gates at level two being d. So, P can he represented as 

T d 

P = cni ■ Qij 

i=l j=l 

Then, P can be computed by a EIIE A Eli circuit C" with the following properties : 

1. The inputs to the A gates are the siblings of polynomials {Hom-^[Qy] I<j< 

d} 

2. The fan-in of the x gates at the second level from the top is at most n 

3. The top fan-in of C” is at most Td^n^2^^'^\ 

4.3 Random Restrictions 

From the definition, it follows that the total number of variables in NWn,)j, is N. Let the 
set of all these variables be V. We now define our random restriction procedure by defining 
a distribution 1) over subsets V C V. The random restriction procedure will sample V V 
and then keep only those variables “alive” that come from V and set the rest to zero. We will 
denote the restriction of the polynomial obtained by such a restriction as NWn^ifiv- Observe 
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that a random restriction also results in a distribution over all circuits computing the polynomial 
NWn,fi- We denote by C\v the restriction of a circuit C obtained by setting every input gate 
in C which is labelled by a variable outside y to 0. 

The distribution Vp-. Each variable in V is independently kept alive with a probability p. 
We will choose the value of p based on the parameter p. 

4.4 Analysing the circuit under random restrictions 

Let C be a Eli (EII)^^ ^ circuit computing the polynomial NWn^p- Let the top fan-in of C be 
T and the product fan-in at the second level be d. So, we have the following expression. 

T d 

NWn,p =Y.^ I[ Qb 

i=l j=l 

where each Qij depends on at most variables. 

Recall that from the choice of parameters 5 = (1 — p)/2. Let s be a parameter, which we 
later set such that s = 0(i/n). If T • d > Ni^, then we already have the desired lower bound 
of on the size of C and we are done. Therefore, for the rest of this discussion, we will 

assume that T ■ d < . We now apply the transformation to C given by Corollary 14.61 to 

obtain a ERE A EH circuit C", which has the following properties: 

1. The inputs to the A gates are the siblings of polynomials {Hom-^[(5y] : 1 < * < T, 1 < 
j < d} 

2. The fan-in of the x gates at the second level from the top is at most n 

3. The top fan-in of C" is at most 

We now analyse the effect of the random restrictions on the circuit C”. We will choose a 
parameter p = N~^~^ and keep every variable alive with a probability p. The circuit C" can 
be represented as 

U V 

Here, each Duv is a sum of powers of the siblings of Hom-^[(5ij]. Our goal is to argue that 
under random restrictions, all the monomials in each of the D^v are of small support (support 
at most s). 

For any polynomial P in variables and any integers t, to such that to < t, observe that 
P* can be written as 


P* = Po + ^ a • 

OL 

where Pq is the part of P consisting of monomials of support strictly less than to- The inner 
sum is over all multilinear monomials a of support equal to to- Such a decomposition may not 
be unique, but for this application, it would suffice to work with any one such decomposition. 
The number of such monomials a is at most - The probability that one such monomial 
survives the random restriction procedure is equal to So, the expected number of such 
multilinear monomials a surviving the random restriction procedure is at most . The 

crucial observation is that if no such monomials survive, then only the monomials in Pq survive, 
all of which have support at most to ~ 1- 

Now, observe that each of the Duv are a sum of powers of the siblings of polynomials in the 
set j] : 1 < i < T, I < j < d}. Define B to be the set of all multilinear monomials 

of support equal to s, supported entirely on variables in any of the polynomials Qij for some 
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1 < i < T,1 < j < d. From the discussion in the paragraph above, the following observation 
follows. 

Observation 4.7. Let the polynomials Duv, Qij and the set B he as defined above. Then, 

• \B\<T-d- 

• If none of the monomials in B survive under some random restrictions, then each of the 
polynomials obtained as a restriction of Dy^y has all monomials of support at most s. 


Proof. The bound on the size trivially follows from the fact that each of the Qij depends on at 
most variables. For the second item, observe that each of the Dyy is a sum of powers of 
siblings of the Hom“^[Qy] and all the siblings are supported on the same set of variables. If all 
the monomials in the set B are set to zero, then the surviving monomials in any power of any 
of the siblings of Hom-^[(5y] has support at most s. □ 


We now estimate the probability that at least one of the monomials in the set B survives 
the random restriction procedure. We have the following lemma. 

Lemma 4.8. Let 6 be a positive real number such that 6 = (1 — ^)/2 and let p = N~^~^. Then 

Prv^Vy [\B\v\ > 1] < IV-3/4-^- 


Proof. We know that 

|e|<r.d.(";) 

and the probability that any fixed monomial in B survives the random restriction procedure is 
at most p®. So 

^v^Vy[\Bv\\ <T ■ d - i ^ ] ■ p^ 

Now, observing that the value of T • d is at most Ni^ and p = , the expected value is at 

most 

. JV-(m+<5)s ^ ji^-3/4.S.s 

The lemma then follows by Markov’s inequality. □ 

As a corollary of Lemma 14.81 and Observation 14.71 we get the following lemma. 

Lemma 4.9. Let 6 be a positive real number sueh that d = (1 — /i)/2 and let p = N~^~^. 
Then with probability at least 1 — 7V-3/4-5-S over random restrictions V •<— Dp, the polynomial 

computed by the circuit C''\v can he written as Y^^=iY[v=i ^'uv ^ where each of the monomials 
in each of the polynomials D^y has support at most s. 

4.5 Upper bound on the complexity of C 

In order to upper bound the dimension of the projected shifted partial derivatives (under random 
restrictions) of the circuit C, Corollary 14.61 implies that it suffices to upper bound 

the dimension of the space of projected shifted partial derivatives of the SIIS A EH circuit C” 
given by Corollary 14.61 In some sense, C” is more structured than C and this lets us prove a 
better upper bound. 

Recall that we are under the assumption that for the circuit C, the product of the top fan-in 
and the product fan-in at level two is at most N*'^, else we are already done. From Lemma IT^ 
we know that with a high probability, under random restrictions, we are left with a circuit of the 
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form ELiOL -DE where each of the monomials in each of the polynomials has support 
at most s. The upper bound on the complexity of the projected shifted partial derivatives of 
eLiOE then just follows from the upper bound for homogeneous depth four circuits of 
bounded bottom support proved in |KLSS1 IKS 14b) . We restate the bound from [KS14bj . 

Lemma 4.10. Let C be a depth 4 cireuit with the fan-in or product gates at level two bounded 
by n, the bottom support bounded by s and computing a polynomial in N variables. Let M. be a 
set of monomials of degree equal to r and let m be a positive integer. Then, 

^M,m{C) < Top fan-in{C) ^ \ 

\ r J \m + rsy 

for any choice of m, r, s, N satisfying m + rs < N/2. 

\N^] 

The upper bound for EH (EH) ' circuits, follows easily form the above lemma after random 
restrictions, and we formalize this in the lemma below. 

Lemma 4.11. Let p be a positive real number such that 0 < p < 1. Let 6 = {1 — p)/2 and let 
p = N~^~^ and let ¥ be a field of characteristic zero. Let P be a polynomial of degree n in N 
variables over F which is computed by an EH (EII)^'^ ^ circuit C of top fan-in T and degree of 
product gates at level two at most d, i.e P can represented as 

T d 

P = oii ■ Qij 

i=l j=l 

where are field constants. Let m and r be positive integers satisfying m -\- rs < N/2 and M 
be any subset of multilinear monomials of degree equal to r. IfTd< , then with probability 
at least 1 — jV-3/4-<5-s over random restrictions V •<— Pp, 

\m-\-rsJ \ r J 

Proof. The lemma follows immediately from Corollary 14.61 Lemma 14.91 and Lemma 14.101 □ 

4.6 Nisan-Wigderson polynomial under random restrictions 

To complete the proof of Theorem 1 1.21 we need a lower bound on the dimension of the space of 
projected shifted partial derivatives of the polynomial NWn,^, under random restrictions. To 
this end, we will use the lower bound proved by Kayal and Saha [KS14a] . We first enumerate 
our choice of parameters. Recall that d = (1 — /i)/2 is a positive real number. 

1 ..y 2(;j+(5) + l 

2. N is such that N/n is set equal to the smallest prime number between and 2 ?t,^+'’'. 

3. = + 

‘i. D — 2 {-^+-y) ■ ^ ’ where H — 1 is the degree of the underlying univariate polynomials in the 
definition of NWn,fi. 

5. r, s which are the order of derivative and the bound on bottom support of the circuit after 
random restrictions respectively, are chosen such that r = ei • ^/n, s = €2 ■ \/n. Here, ei 
and €2 are small enough positive real numbers satisfying ei • £2 = O.OOln. 

6 . m = y(1 ~ is the degree of the shifts. 

7. p = is the probability with which each variable is independently kept alive. 
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8 . Ai is the set of all multilinear monomials of degree r. We take partial derivatives with 
respect to monomials in this set. 

We are now ready to state the lower bound on the dimension of projected shifted partial deriva¬ 
tives as in [KS14a] . 

Lemma 4.12 (Kayal-Saha [KSllaj l. Let NWn,^ be Nisan-Wigderson polynomials as defined 
in Definition \4-l\ Let F be any field of characteristic zero. Then, for the choice of parameters 
defined above 


^M,m{NWn,p,\v) ^ V 4^ 


N' 


N 

m + n — r 


with probability at least 1 — over random restrictions V •4— Dp. 


4.7 Wrapping up the proof of Theorem 11.21 


From Lemma 14.121 and Lemma 14.91 we know that with a non-zero probability over the random 
restrictions V from the distribution Dp, the following two conditions hold. 

1 . 

1 fN\ /N\ ( N 

>-^min ( — 


2 . 


^M,m{C\v) < Td^n^ ■ rs ■ ■ ( 

\ 

If C computed the polynomial NWn.pi., then 


m) \m + n — r 


N \ fn + r 
m + rs I V r 


Tdfn^ ■ rs > 


20(V^) .( ^ ). 

\m-\-rs/ \ r / 


From the calculations in Appendix it follows that for our choice of parameters, the ratio 
is at least exp(y7ilogn). So, we have the following theorem. 

Theorem 4.13. Let p, be an absolute constant such that 0 > ^ < 1 and ¥ he a field of 
characteristic zero. For 1 <i <T and 1 < j < d, if there exist polynomials Qij, each dependent 
on only s = variables, such that 


T d 

NWn,p. = 

i=l 

Then 

T-d> 

As a remark, we mention here that the lower bound above also holds for any translation 
NWn,p{X -I- a) of the polynomial NWn,fj.{X). This is because the highest degree term of 
NWn,p{X + a) equals the polynomial NWn,)i{X) and from Lemma [T5l the homogeneous com¬ 
ponents of a polynomial computable by small sized Eli (Eli) circuits also have small sized 
En(En)^'’^ circuits. We leave the details to the interested reader. 
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5 Application to polynomial identity testing 


In this section, we prove Theorem 11.31 We are interested in identity testing for Eli (Eli) 
circuits, i.e for polynomials in N variables {Xi, X 2 , ■ ■ ■, Xn} which can be expressed in the 
form 

P = ^ Qij 

i=i j=i 

such that 

1. The individual degree in P of every variable is at most k 

2. Each Qij depends on at most s variables 

For the case of this application, we will think oi k,T being polynomial in (logiV) and s being 
7 Vi/ 2 -e fQp a positive constant e. Observe that the bound on individual degree lets us upper 
bound the total degree of the polynomials hy Nk. 

We describe the construction of the hitting set in Section 15.21 and prove its correctness in 
Section 15.31 We go over some preliminaries that we need in our proof in the next section. 


5.1 Some preliminaries 

In the following lemma, we prove some properties of the model of En(En)^®^ circuits, which 
will be useful in the proof of the identity testing result. 

Lemma 5.1. Let ¥ be a field of eharacteristie zero. Let P he a non-zero polynomial in N 
variables and individual degree at most k over F, which is computed by a En(En)^*^ circuit C 
of top fan-in T and product fan-in d at level two, i.e P can be expressed as 

= X! n 

i=i j=i 


sueh that for each i G [T] and j G [d], Qij depends on at most s variables. Then, the following 
are true. 

1. For every variable y and integer 1 < j < k, can be computed by a circuit of the form 


d^P 

W 


T' d 


Enot 


where T' < T ■ {k -\- 1)^ and each of the polynomials QT depends on at most s variables. 
2. For any a S F^, P{X + a) can he computed hy a circuit of the form 


T d 


P{X + a)=Y,Y[Q'h 

i=l j=l 


where eaeh of the polynomials Q'h depends on at most s variables. 

Proof. The proof of the second item is immediate from the definitions. The only thing that 
changes due to a translation is the number of monomials in the Qij. The number of variables 
that each Qij depends on remains unchanged, and so does the fan-in of the top sum gate and 
the product gates at level two. 
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We now prove the first item. Let the set of variables in Phe X = X'VJ{y} where X' is of size 
N —1. Since the individual degree of P is at most k, we can write P = J2i=o Here, 

Ci{X') are polynomials only in the X' variables and are the coefficient of y*, when viewing P as 
an element of Now, for every 0 < i < k, we can compute each of Ci by a SII 

circuit with top fan-in at most T-{k + l) by interpolation as given by Lemma l3.4l All the partial 
derivatives of P with respect to y are linear combinations of the terms of the form Cj-^ ■ . And 

so, the result follows. □ 

We will also need the following simple fact about polynomials. 

Lemma 5.2. Let ¥ he a field of characteristic zero. Let R S F[?/] be a non-zero polynomial of 
degree at most t over the field F. Then, for every a S F such that R(a) = 0, there exists a j 
such that 0 < j < t — 1 and |^(a) = 0 and (a) 0 . 

Proof. Let the degree of i? in y be equal to t'. This means that the coefficient of highest degree 
term y* in R is non-zero. Let us call the coefficient of y* in R{y) as Ct'- We know that Ct' is 
nonzero. Consider j = t' — 1. The lemma immediately follows. □ 

We will crucially use the following result of Dvir, Shpilka, Yehudayoff [DSYOQj in the analysis 
of the hitting set constructed in this paper. 

Lemma 5.3 (Dvir, Shpilka, Yehudayoff |DSY09) 1. For a field F, let P G F[Ai, A 2 ,..., Xjv, Y] 
be a non-zero polynomial of degree at most k in Y. Let f G F[Ai, X 2 ,..., A^v] be a polynomial 
such that P{Xi,X 2 ,..., Xpf, /) = 0 and |^(0, 0,..., 0, /(O, 0,..., 0)) 7 ^ 0. Let 

k 

P = Y,C^{X,,X2,...,XN)■y^ 

i=0 

Then, for every t > 0, there exists a polynomial Rt G F[Zi, Z 2 ,..., Z^+i] of degree at most t 
such that 

Hom^*[/(Ai, A 2 ,..., A^)] = Hom^*[i?t(Co, Ci,..., C^)] 

A key technical idea in the proof will be the notion of Nisan-Wigderson designs introduced 
in [NW94) . We will use the following lemma. 

Lemma 5.4 (Nisan-Wigderson |NW94] 1. For every a, & S N, h < 2“, there exists a family of 
sets Si, S 2 ,. ■., Sb Q { 1 , 2,... ,1} such that 

1. lGO{af/\ogh) 

2. for all i, |S'i| = a 

3. for all i ^ j, 15^ fl Sj| < log 6 

Moreover, such a set family can be constructed in time polynomial in b and 2}. 

We will also use the following lemma of Alon [Alo99] very crucially in our proof. 

Lemma 5.5 (Combinatorial Nullstellensatz [Alo99) l. Let P be a non-zero polynomial of indi¬ 
vidual degree at most d in N variables over a large enough field F. Let S be an arbitrary subset 
o/F of size d -I- 1. Then, there exists a point p in such that P{p) 7 ^ 0. 
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5.2 Blackbox PIT for En(Sn)^^' circuits 

In this section, we prove the following theorem. 

Theorem 5.6. Let c and n he arbitrary constants such that c > 0 and 0 < fj. < 1/2, and let F 
be a field of characteristic zero. Let C be the set of polynomials P in N variables and individual 
degree at most k over F, with the property that P can be expressed as 

T d 

=X/ n 

i=i i=i 


such that 

1 . r<log°iV 

2. k < log'^ N 

3. d<N^ 

f. each Qij depends on at most variables 

Then, there exists a constant e < 1 dependent only on c and /i, such that there is a hitting set 
of size exp(A^*^) for C which can he constructed in time exp(A^'^). 

From our proof, it also follows that if each of polynomial Qij depends only on log*^^^^ N 
variables, then both the size of the hitting set and the time to construct it, are upper bounded 
by a quasipolynomial function in N. In the rest of the section, we prove Theorem l5.6l We start 
by describing the construction of the hitting set H. 

5.2.1 Construction of hitting sets for circuits for 0 < /i < 1/2 

Given fx such that 0</r<l/2, we pick the parameter fx' such that 0 < ^' < 1 and ^ is a 
positive constant strictly smaller than 1. We construct a family of Nisan-Wigderson designs as 
described in Lemma 15.41 with the following parameters : 

1. b, the number of sets is set equal to N 

M 1 

2. a, the size of each of the sets Si is set equal to NP logi^ N. 

3. I, the size of the universe is chosen large enough in order to satisfy the hypothesis of 
Lemma 15.41 From Lemma 15.41 it follows that we can pick I which is not too large {I G 
0{a^/ logb)). For the above chosen values of a,b, there is a choice of I such that I is at 
most logl^”^ N. 

Recall that our goal is to construct a hitting set for ^ circuits. Observe that the 

choice of parameters l,a,b satisfy the hypothesis of Lemma EH So, we get a collection of N 
subsets Si,S 2 , ■ ■ ■, Sn of {1, 2,3,... ,1} satisfying 

1 . for all 1 < i l^il = a 

2 . for all 1 < i |S'i O < logA^ 

Moreover, these sets can be constructed in time polynomial in b and 2/ We identify the set 
{1,2,3,...,!} with the set of new variables Y = {¥±,¥ 2 ,..., Yi}. Before we proceed further, we 
need some notation. We will pick <5 = (1 — pi')/2 to be a non-negative constant. Given, a,fx',S, 
we define 7 = Then, we define q to be the smallest prime number between (a/2)^ 

l+T' 1 

and 2 • (a/ 2 ) ^+~i. Also, we set a' to be equal to (a/ 2 ) ‘^+ 1 . Observe that a /2 < a'q < a. 
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For each *, such that 1 < i < iV, let S/ be an arbitrary subset of Si of size equal to a'q. For 
brevity, we rename the sets 5' as Si [3. Let p = (/r' + 6) f and D = ^■ 

Often for the ease of notation we will identify the set Si of {1,2,...,!} with the set of 
variables {Yj : j S Si}. We will think of the variables {Yj : j G Si} to be arranged in a a' x 9 
matrix V (z), with the variables placed in the matrix in some order. For every i G {1, 2, 3,..., TV}, 
we define NWa',ii'{Si) as 


NWa\ii.'{Si) — 

fiz)e¥g[z] 

deg(f)<D-l 

For a point p = {pi,p 2 , ■ ■ ■ ,pi) G F^, we denote by NWa\fi'{Si)\p, the evaluation of NWa',fj.'{Si) 
when the variable Yj is set to pj. 

Let G be an arbitrary subset of F of size Nka' + 1. We define the hitting set T-L as follows. 
Definition 5.7 (Definition of the hitting set T-L). 

Ti= [{NWa',i,'{Si)\p,NWa',,,'[S2)\p....,NWa',g'[SN)\p).p€G^] 

We now proceed to prove the correctness of the construction. We first prove the following 
lemma which shows that Ti is explicit and has the correct size as per Theorem 15.61 

Lemma 5.8. The set Ti as defined in Definition \5. 7] has size at most [Nka' + 1)* and all its 
elements can be enumerated in time a“ • {Nka' + 1)* • . 

Proof. The size of the set Ti is equal to |G|* = {Nka' + 1)^ The set Ti can be enumerated 
by enumerating through the points p in G* in some natural order (say lexicographic order) and 
evaluating the tuple {NWa\ij,'{Si)\p, NWa\fj,'{S 2 )\p, ■. ■, NWa',fj.'{SN)\p) at each of these points. 

For every point p and subset Si, the polynomial NWa'^ii'{Si) can be evaluated in time at most 
a“ X Poly(A^) from Lemma [4.21 So, the second part of the lemma follows. □ 

Observe that for our choice of parameters, the above bounds on the size and the time of 
enumeration are bounded by a function which is subexponential in N. 

We now show that for every non-zero polynomial P in the class C, as defined in the statement 
of Theorem 15.61 there exists a point p € Ti, such that P{p) is non-zero. We show this in 
Lemma 15.91 below. That will complete the proof of Theorem 15.61 

5.3 Correctness of the construction 

For the rest of this section, we denote N'' by s. 

Lemma 5.9. Let P be a non-zero polynomial in the set C as defined in the statement of 
Theorem \5.6\. and let Ti be the set defined in Definition \5.7\ Then, there is a point p in the set 
Ti such that P{p) 7 ^ 0. 

Proof. We define 

P^{X,Y) := P{NWa',i.>{Sl),NWa>,g'{S2), . . . ,NWa',g'{S,),X,+ i,X,+2, ...^Xn) 

to be the polynomial obtained from P by substituting the variables Xj by NWa',fj.'{Sj), for 
every 1 < j < z. 

®We have replaced the family {S'!, S 2 , • • •, Sn} by the set family {SJ, S' 2 ,..., S(v} such that for each i G [N], 
S'i C Si. Observe that the design based properties of the original system continue to hold. The only thing that 
changes is that the size of S'i could be smaller than the size of Si, by at most a factor 2. 


21 





From the construction of our hitting set, it follows that it would suffice to argue that the 
polynomial Pn{X,Y) is non-zero. If this was true, then the lemma above will follow from 
Lemma ISTSl since the degree of any variable P{X,Y) is at most Nka'. 

We proceed via contradiction. If possible, let Pn{X,Y) be identically zero. Since P = 
Po{X, Y) is non-zero to start with, by a hybrid argument it follows that there is an index i, such 
that Pi{X,Y) is non-zero while Pi+i{X,Y) is identically zero. Observe that Pi is a polynomial 
in the variables Y and Xi^i, Xi+ 2 , ■ ■ ■, Xiq. In going from Pi to Fi+i, we substituted the variable 
Xi^i by the polynomial NWa',)j.>{Si+i). Since Pi{X,Y) is non-zero by assumption above, there 
exists a substitution c of all variables apart from {Yj : j S iS'i+i} and which keeps the 

polynomial non-zero. Let the polynomial resulting after this substitution be P'. From the 
definitions, it follows that 


P' = P{NWa',i,'{Si)\c, NWa',i.'{S2)\c, . . . , TV ) IO ^^+l, ^*+2 | C, ■ • ■ ,^jv|c) 


Observe that each of the polynomials NWa',fj.'{Sj)\c depends only on the variables in the 
set Sj n iSi+i. From the properties of Nisan-Wigderson designs, and the choice of parameters, 
the size of this intersection is at most log TV. From the definition of Pi and the choice of c, P/ 
is not identically zero. We will think of P/ as a polynomial in TV^+i with the coefficients being 
polynomials in the variables in the set {Yj : j G Si+i}. Now, we know that the the polynomial 
P /+1 obtained by substituting Xi+i by NWa\i_i'{Si+i) is identically zero. Hence, it must be the 
case that Xi+i — NWa',^i'{Sij^i) is a factor of P/. 

To proceed further, we need the following claim. 

Claim 5.10. P/ as defined above can be represented as 




r=l j=l 


such that each of the polynomials QP depends on at most s log TV variables. 
Proof. Recall that P can be represented as 


T d 


P — ^ Qij 

i=i j=i 


where each Qij is a polynomial in at most s = variables. In going from P to P/, we have 
substituted each of the variables outside the set {Yj : j G S'i+i} U {Xi+i} by either a constant or 
by the polynomial NWa',ii'{Sj)\c (which is a polynomial in at most < log TV variables) 

for some j. In either case, after substitution, the polynomials QP obtained from Qrj depends 
on at most slog TV variables, since Qrj depended on at most s variables. This completes the 
proof of the claim. □ 


Moreover, since the individual degree of variables in P is at most k, the individual degree 
of Xij-i in P/ is at most k. The goal now is to invoke Lemma 15.31 which would imply that 
NWa',iJ,'iSi+i) also has a small circuit as a sum of product of polynomials in few variables, 
and together with the lower bound from Theorem 14.131 this would lead to a contradiction.We 
essentially follow this outline. Formally, we use the following claim to complete the proof of 
Lemma ICTl We defer the proof of the claim to the end. 

Claim 5.11. //— TVWay^'(5'i+i)) divides P', then TVWay^'(5'i+i) can be written as 

/' d' 

Enr. 

r=l j=l 
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where 


1. r < {da'"^ + 1) • X (r.(fc+j_)3+ay+i 

2. d' < d ■ a' 

3. Each Trj depends on at most s log N variables 
From our choice of parameters, recall that 

a = ■ log^/^' N 


and 

s = N>^ 

Therefore, slogiV < ■ \ogN < . To complete the proof, we observe that by Theo¬ 

rem Km we must have 

I'd' > 

But, for our choice of parameters, 

1. r < {da'"^ + 1) • X ^ (iaC’(Tfe^) < (since a and a' are 

polynomially related) 

2 . d' < da' 

This implies that I'd' < \ From our choice of parameters, slogiV < and 

Tk'^ + 21ogd S o(\/a'). This contradicts that I'd' > This completes the proof of 

Lemma 1531 assuming Claim [5TT1 □ 

We now give a proof of Claim 15.111 

Proof of Claim 15.111 From Claim 15.101 we know that 

r=l j=l 

such that each depends on at most slogA^ variables. Since P' is not identically zero and 
NWa',ii'{Si+i) is a root of P/, it follows from Lemma lOl that there is an integer A such that 
0 < A < fc — 1 and, 

^^i+1 

and 

OA-I-I p! 

~ p' 

From Lemma 15.11 it follows that P' = can also be expressed as 

PI ~ ^ n 

r=l j=l 

where T' < T ■ {k + 1)^ and each of the Qrj depends on at most s log IV variables. 

Observe that, P' vanishes when NWa',fj.'{Si+i) is substituted for W-i-i, while its derivative 
with respect to W-i-i does not vanish identically at W-i-i = NWa>,fj,'{Si+i). So, in particular, 

dP' . 

there is a substitution of the Y variables where the derivative av ‘ is nonzero. Since the class 
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of En(I]n)^'’^ circuits is closed under translations of variables (from item 2 in Lemma 15. II) . we 
can assume without loss of generality that the derivative is nonzero when all the variables in 
V are set to zero. Also observe that by this variable translation, we have actually obtained a 
polynomial NW^,^^,(Si+i) from NWa\fi'(Si+i). Moreover, the degree of NW^,^^,(Si+i) is equal 
to a' and the homogeneous component of degree a' of NW ^^,is equal to NWa',n'{Si+i). 
Let the polynomial obtained after the variable translation from P/ as P/'. At this point, the 
hypothesis of Lemma [531 is satisfied by P". 


Let P" = ■ ^i+i- Here, Cj{Y) is a polynomial only in the Y variables and is the 

coefficient of when viewing P" as an element of F[y][Ai+i]. From Lemma 13.41 we know 

that each of the polynomials Cj can be expressed as a polynomial of the form 




r=l 1=1 


where Tj < T' ■ {k + 1) < T ■ {k + 1)^ and each Q"; depends on at most s log N variables. 

Hence, by Lemma lOl for every f > 0, there exists a polynomial P* S F[Zi, Z 2 ,..., Zk+i] of 
degree at most t such that 

The goal now is to obtain a representation of NWa',fi'{Si+i) as a sum of products of poly¬ 
nomials in few variables and show that this contradicts the lower bound in Theorem 14.131 
7VVlP,_^,(S'i+i) is a polynomial of degree at most a'. So, there is a polynomial Ra' of degree at 
most a' in A: -f 1 variables such that 

From the discussion on the relation between NW^^, ^,{Si+i) from NWa',f_i'{Si+i), we also know 
that 

= Hom“'[iVlF^,_^,(P,+i)] = Hom“'[P,,(Co, Ci,..., C^)] 

Since Ra' is a polynomial in fc -I- 1 variables of degree a', the number of monomials in Ra' is 
at most (“ Therefore, we can represent Ra'{Co, Ci,..., Ck) as a sum of products of the 

Cj’s, with the sum fan-in at most (“ and the product fan-in at most a'. Moreover, each 

of the product gates in this representation takes the polynomials Cj’s as inputs. We know that 
each Cj can be written as 

Tj d 

r=l 1=1 

where each Q”^ is a polynomial in at most s log N variables, and the top sum fan-in Tj is at most 
T ■ (k + 1)^. For any t, the polynomial Cj, has a similar representation with the top sum fan-in 

at most ’*'*). Therefore, any product of fan-in at most a' in the C^’s can be written as 

a sum of product of polynomials in at most s log N variables, with top fan-in at most 

T-(/c-kl)3-hr'^ 


since each Cj is raised to a power of at most a' and there are fc -|- 1 such Cj’s. Therefore, 
Pq'(Co, Cl,..., Cfc) can be written as 

I d’ 

p„,(co,ci,...,Cfc)=^nrr, 

r=l j=l 


such that 
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1. I < (%+') X (T-(fc+^l/+ay+l 

2 . d! <d - a' 

3. Each PE depends on at most slogA^ variables 

We would now like to extract the homogeneous part of degree a' of Ra'{Co, Ci,..., Cfe), which 
we know is equal to NWa',fj.'(Si+i). We do this by a standard application of Lemma 1531 Since 
we are interested only in the homogeneous part of degree a', we can assume without loss of 
generality that each of the polynomials TR is of degree at most a' (we can discard all monomials 
of degree larger than a' in each of the since they do not contribute to the homogeneous 
component of degree a' of Ra'{Co, Ci,..., Ck) )■ Hence, the degree of Ra'{Co, Ci,..., Ck) is 
upper bounded by da' ■ a'. So, from Lemma Td. 5 1 we can extract the homogeneous component 
of degree a' of Ra' (Co, Ci,..., Cfc) by blowing up the top fan-in by a factor of at most -I- 1. 
Hence, NWa',fi'{Si+i) can be expressed as 

/' d' 

NWa',t,'{S,+ i) = Enr. 

r=l j=l 

where 

1 . r < {da'"^ + 1) • ^ 

2 . d' <d - a' 

3. Each Trj depends on at most s log N variables 

□ 

We remark that if the value of s was log*^*-^^ N to start with, the same proof as above goes 
through with I and a being set to polynomials of sufficiently high degree in logA^. The size of 
the hitting set and the time to construct it in this case are upper bounded by a quasipolynomial 
function in N. 


6 Open problems 

We conclude with some open problems. 

1. An intriguing open qnestion is to obtain PIT for EH (EH)^*^ circuits without the restriction 
on the individual degree. The strategy in this paper relies on hardness randomness tradeoffs 
for bounded depth circuits |DSY09] . The tradeoffs in |DSY09] crucially use the fact that 
the individual degree is bounded. 

2. Another related question would be to get any non-trivial PIT (even subexponential) for 
the sum of constant many products of degree two polynomials. 

3. It would also be interesting to understand if one could obtain any non-trivial PIT for 
slightly non-multilinear depth four circuits (say individual degree at most 2 ) with bounded 
top fan-in. A natural strategy for this question would be to reduce it to the case of 
En(En)^®^ circuits by either expanding out the polynomials Qtj which depend on too 
many variables or use a partial derivative like trick, as in [dOSVId] . The immediate 
challenge in this case is that the top fan-in seems to increase by any of these tricks and 
the calculations in this paper seem to not work out. 
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( ^ ) 
Vm+n—r/ 


> 


(m + rs)! (N — m — rsy. 


( + ^ (m + n — r)\ {N — m — {n — r))\ \e{n + r) 


Here 


we use the fact that • Now, approximating the ratios using Lemma [331 

-f(l- 

( ^ ) 

\m+n—r/ 


and substituting m = ^(1 ~ n )’ 
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N- 


/ Af \ (n+r\ — 

Vm+rs/ \ r / 

> exp 


n—r—rs 
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r Inn 
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i{n + r) 


(n — r — rs) — r In 


3 (n + r) 


Since r = Q{y/n), we get that the ratio is at least exp (r lnn((n — r — rs)/n — 5 + o(l))), 
which is exp(H(\/nlnn)). 


Next we estimate the ratio 


(f ^ m ■ ffl) 

/ N \ _ ^n+r\ 
Vm+rs/ \ r } 


(f ■ (b ■ o) 

\m+rs/ \ r / 
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(m + rs)! {N — m — rs)\ N\ 

'¥ ' " 


n! 


i! (N — my. (N — r)! (n + r)! 


P 


4r I TV _ 


, /AT 

m \ / N 


A „ „ Inn 
> — • 1 - 2 . 01 r- 

d*" V n 


> — exp (—r(fj, + S) In N — 2.01r^s —— + r \n{N/n) 
d*" V n 


Here, we used Lemma [+3 in the second step and substituted p = N 1'^+^) in the last step. Now, 
substituting 2n^+'>' > N > n^’*"'’', the exponent is at least 

r lnn(—(p, + i5)(2 + 7 ) — 2.01rs/n + (1 + 7 )) 


This is at least 

r lnn(—(/i + S){2 + 7 ) — 2.01rs/n + (1 + 7 )) 

Now, plugging back the value of 7 , the exponent is at least (2 — 2.01rs/n)rInn. We have 
chosen rs such that rs/n < 0.001. Therefore, the ratio we set out to lower bound is at least 
exp(H(-ynlnn)). 
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